TIGHT MARKOV CHAINS AND RANDOM COMPOSITIONS 
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O ■ Ohio State University 

p^ 1 Abstract. For an ergodic Markov chain {X{t)} on N, with a stationary distri- 

bution TT , let Tn > denote a hitting time for [n]^, and let X„ = X(T„). 
Around 2005 Guy Louchard popularized a conjecture that, for n — >• oo , Tn is almost 
Geometric(p), p = 7r([n]'^), Xn is almost stationarily distributed on [n]'^ , and that 
Xn and Tn are almost independent, if p{n) := supj p(i, [n]'^) — > exponentially fast. 
For the chains with p{n) —> however slowly, and with supj ■ \\p{i, •) —p{j, ■)\\tv < 
^^^ 1 , we show that Louchard's conjecture is indeed true even for the hits of an ar- 

v^^ ' bitrary Sn C N with 7r(S'n) — > 0. More precisely, a sequence of k consecutive 

C J . hit locations paired with the time elapsed since a previous hit (for the first hit, 

since the starting moment) is approximated, within a total variation distance of or- 
der k supjp(i, Sn) , by a fc-long sequence of independent copies of {(.n,tn) , where 
Cu I In = Geometric (7r(5'ra)) , tn is distributed stationarily on Sn , and In is indepen- 

dent of tn ■ The two conditions are easily met by the Markov chains that arose in 
Louchard's studies as likely sharp approximations of two random compositions of a 
large integer i/ , a column-convex animal (cca) composition and a Carlitz (C) compo- 
sition. We show that this approximation is indeed very sharp for each of the random 
compositions, read from left to right, for as long as the sum of the remaining parts 
stays above In f . Combining the two approximations, a composition - by its chain, 
and, for Sn = M"^ , the sequence of hit locations paired each with a time elapsed from 
the previous hit - by the independent copies of {In, tn) , enables us to determine the 
limiting distributions of /i = o{lni/) and /i = o(i/^") largest parts of the random 
cca composition and the random C-composition, respectively. (Submitted to Annals 
of Probability in August, 2009.) 
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1. Introduction. Consider a Markov chain X{t) on N. Given 5* C N, let T{S) 
be the hitting time, i.e. T{S) = mm{t > : X{t) e S}. Keilson [14] proved that 
if a state i is positive-recurrent, and a nested sequence Si ^ S2 ^ • • • is such that 
i ^ Si and Ei[T{Sn)] -^ 00, then 

The basic idea of the proof was that the probabihty of hitting 5*^ between two 
consecutive returns to i is smah, of order l/Ei[T{Sn)], and so T{Sn) is roughly 
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2 BORIS PITTEL 

the sum of the geometrically distributed number of i.i.d. times between those returns 
to i. 

If a chain is ergodic, with a stationary distribution n , the condition Ei[T{Sn)] — ^ 
(X) is met if (and only if) 7r(S'„) := X^ies "^(0 ~^ 0- Indeed, by Derman's theorem 
[9] (see Durrett [10], Ch. 5), the expected number of visits to Sn between two 
returns to i is iT{Sn)/'n'{i) ■ So the probability of hitting Sn between two returns 
to i is 7r(S'„)/7r(i) at most, whence Ei[T{Sn)] > 7r{i)/7r{Sn) ■ 

Aldous [1] estimated accuracy of the exponential approximation of the hitting 
time for a finite-state ergodic Markov chain, when an initial state is chosen at 
random, in accordance with the stationary distribution vr . Roughly, the discrepancy 
is small if the expected hitting time far exceeds a relaxation time r = maxj min{t : 
||p*(i,-) — 7r(-)||Ty < p}, P < 1/2. r "measures the time taken for the chain to 
approach stationarity" in a sense that maxi ||p*(i, •) — 7r(-)||Ti/ < (2p)l-*/'^J . 

Precisely because these results are so strikingly general, more subtle questions 
remain open. Is there a geometrically distributed random variable close to T(S) in 
terms of the total variation distance? What is, asymptotically, the joint distribu- 
tion of the hitting time T{S) and the hit location X(T{S))? Is there an explicit 
convergence rate in terms of the total variation distance? Are X(T{S)) and T{S) 
almost independent? How does one describe asymptotic behavior of the first k 
visits to the rare set S , ii k = k{S) is not too large? 

For an ergodic Markov chain {X{t)} on N, with a stationary distribution vr, let 
r„ > denote a hitting time for [n]" = N\ [n] , and let X„ = X(T„) . Around 2005 
Guy Louchard [18] popularized the following conjecture. If p{n) := sup^ p{i, [n]*^) = 
0{q^), q < 1, then r„ is almost Geometric(p), {p = vr([n]'^)), Xn is almost 
stationarily distributed on [nY , and X„ and Tn are almost independent. The 
Markov chains with p{n) = 0{q^) arose in the studies of two random compositions, 
Louchard [19], [20] and Louchard, Prodinger [21] as possibly sharp approximations 
of those random compositions. Louchard's thought-provoking idea was that if the 
conjecture and approximability of each random compositions by a chain would be 
proved, potentially one could obtain the limiting distributions, marginal and joint, 
of extreme- valued parts and, possibly, of other related characteristics of the random 
compositions. 

In this paper we introduce a class of Markov chains that contains the chains from 
[19]-[21] for which we can give full answers to the questions posed above and, in 
particular, fully confirm Guy Louchard's conjecture. We also prove that the chains 
in [19]-[21] indeed provide a good approximation of the random compositions. The 
two approximations made in tandem lead to the asymptotic distributions of the 
extreme-valued parts of the compositions, together with the convergence rates. 

Let us give a more specific description of our results. 

Definition 1.1. An ergodic Markov chain on N, with a transition probability 
matrix P = {p(j, A;)}^^^^^ and a stationary distribution vr, is called tight if the 
family of row probability measures {p(i, OlieN is tight, i.e. 



(1.2) lim sup > p{i,k) = 0. 

n— >oo ^ ^ — ^ 
k>n 

For a tight P, we will prove that if 7^ 5„ C N is such that 7r(5'„) -^ 0, then 
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uniformly for all initial states i , 

(1-3) E,[T''{S^)]r^^-—, k>l, 

SO Ei[T{Sn)] ~ 7r~^(5^) in particular. Thus all the moments of T{Sn)/Ei[T{Sn)] 
converge, uniformly over i, to the moments of the exponential random variable, 
which implies convergence in distribution as well. As for the hit location X{T{Sn)) , 
given Un Q Sn, 



(1.4) lim 



n— )-oo 



P,{X{T{Sn)) € U^} ""^^^^ 



Tr{S„ 



0, 



uniformly for i € N. Thus, marginally, T{Sn) and X(T{Sn)) behave in the limit 
as if X (t) is a Bernoulli sequence with each trial outcome having distribution vr . 

Now suppose that, besides being tight, the chain meets a condition 

(1.5) So := .inf V p{i, k)p{j, k) > 0. 

i,7eN ^ — ' 

fc6N 

For a tight chain, this condition is equivalent to 

Po := sup \\p{i,-) -p{J,-)\\tv < 1, 

which implies that 

\\p''{i,-)-M\TV<Po- 

(So, for the relaxation time r in [1], we have r = [ln2/ln(l//9o)] .) 

Given a random vector Y with integer components, we denote its probability 
distribution by d(Y) . Under the conditions (1.2) and (1.5), we show that, uniformly 
for the initial state z G N, 

(1.6) \\d{{X{T{Sr,),T{Sn)) - d{{in,t^))\\Tv = 0{p{Sn)), p{S) :=supp(fe,5), 

fceN 

where in and tn are independent, 

P{t„ = r}=7r(5„)(l-^(5„))^-i, r>l, 

vr(i„) 

More generally, the A; -long sequence of chronologically ordered locations of first k 
hits of Sn , each paired with the time elapsed since a preceding hit (paired with 
T{Sn) in the case of the first hit) is approximated by the A; -long sequence of inde- 
pendent copies of {in,tn), withiu the total variation distance of order 0{kp{Sn)) ■ 
(Aldous and Brown [2], [3] had used Stein's method to show that, for a station- 
ary, continuous-time, reversible Markov process, the hitting times for a subset A 
of states after prolonged excursions outside of A form an approximately Poisson 
process.) 
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The equation (1.6) yields, rather directly, the limiting distributions of the ex- 
treme values for {X{t)}i<t<N ■ Given ^u, let X^f^' denote the /x-th largest among 
X{1),...,X{N). Then 

(1.7) P,{X(^) <n} = PJPoisson (A^7r(5„)) < fi} + 0{fj,^/N + iV/(5„)), 

and we have an extended version of (1.7) for the joint distribution of X^^\ . . . , X^'^' . 

Turn now to the application of these results to the random compositions studied 
in [19]-[21]. 

A composition of a positive integer i/ is y = (yi,... ,y^), f^ < i^ , such that 
yi, ■ ■ ■ ,y^ are positive integers satisfying 

(1.8) Y.y^ = ''- 

i=l 

Since, for each /i, there are {'^Zi) compositions, we have 2'^~^ compositions overall. 
Assuming that a solution of (1.8) is chosen uniformly at random {uar) we have a 
random composition Y of z^ , its dimension M being random as well. It is known, 
Andrews [4], that 

T> 

(1.9) Y = {Zi,. . . ,Zm-i-,Zm), 

(= meaning equality of distributions), where Zi, Z2, ■ ■ ■ are independent Geomet- 
ries with success probability 1/2, 

(1.10) M = min{m : Zi H h Z™ > J^}, 

and 

M-l 

(1.11) Zm := i^ - Y. ^-r 

Hitczenko and Savage [12] used this connection to the well studied success runs 
in a fair coin-tossing process as an efficient tool for asymptotic analysis of various 
characteristics of the random composition. 

If a random composition Y is not uniformly distributed on the set (1.8), one can 
only hope for asymptotic independence of most of the parts. Lowering expectations 
then, one may search for a Markov chain that approximates the behavior of Y in 
question; ergodicity of such a chain would mean near independence of parts It^ 
and lt2 with \ti — ^2! sufficiently large. 

Here are two examples of such random compositions. A column-convex-animal 
(cca) composition of z^ is a collection of lengths of an ordered sequence of contiguous 
columns on Z^ , whose total sum is u , such that every two successive columns have 
a common boundary consisting of at least one vertical edge of Z^ , Klarner [15], 
Privman and Forgacs [22], Privman and Svrakic [23], Louchard [19], [20]. A Carlitz 
(C) composition meets a condition that no two adjacent parts coincide, Carlitz 
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[8], Knopfmacher and Prodinger [16], Louchard and Prodinger [21], Hitczenko and 
Louchard [11]. 

One obtains a certain, nonuniform, distribution on the set of solutions of (1.8), if 
a column-convex animal is chosen uar from among all such creatures. One obtains 
another nonuniform distribution, if a composition of ly is chosen uar from among all 
C-compositions. We call these objects a random cca composition Y, and a random 
C-composition Y , and denote the random number of components of Y by M . For 
both schemes, Louchard [19], [20] and Louchard and Prodinger [21], determined a 
limiting joint distribution of two successive parts, Yj and ^t+i , in the case when t 
and M — t are of order i^ , and also the limiting distribution, vri , of the first (last) 
part Yi (Ym)- These results strongly suggest, though do not actually prove that, 
in both cases, 

(1.12) p{i,k):= lim P{Yt+i = k\Yt = i}, ii,keN), 

might well be the transition probabilities of a Markov chain, with an initial distribu- 
tion vTi , that closely approximates the whole random composition. We fully confirm 
this conjecture, proving an approximational counterpart of (1.9)-(1.11). The chains 
turn out to be tight, exponentially mixing, and this enables us to use our results 
for asymptotic analysis of extreme- valued parts of both random compositions. Let 
y(M) denote the /i-th largest part of the random composition in question. For the 
random cca composition, we show that, for ^ = o(lnz/) , 

/ N ln(u~-'^z/ln u) , , 

(1.13) yW.J|i__l + 0,(l). 

where z* = 0.31 ... is the smallest-modulus root of 

4^3 - Tz^ + 5z - 1 = 0. 
For the random C-composition, if ^u = oiy^''^) then 



:i.i4) r« = ^S^i^ + o.(i). 



ln(l/z*) 



where z* = 0.57 ... is the smallest-modulus root of 

1 = 0. 



Sr 



(Op(l) stands for a random variable bounded in probability.) It follows from (1.13) 
and (1.14) that the number of distinct values among X^^', . . . , X^-^' is likely to be 
at most (1 -|- o(l)) ln^/ln(l/z^,) , for the corresponding z^, , for ^u = o(lniv) and 
/i = o{y^l'^^ respectively. It can be shown that, in fact, the range is asymptotic 
to ln/i/ln(l/2;*) , in probability. (See Hitczenko and Louchard [11] regarding a 
limiting distribution of a "distinctness" (range size) of the random C-composition.) 

We plan to extend this approach to other constrained compositions, such as quite 
general Carlitz-type compositions studied by Bender and Canfield [6]. 
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The rest of the paper is organized as follows. In Section 2 we show that, for the 
tight Markov chains {X{t)} , the hitting time of a rare set Sn , i.e. with 7r(S'n) — )■ 0, 
scaled by 7r~^(S'„) converges, with all its moments, to the exponentially distributed 
random variable of unit mean, while the hit location has, in the limit, a stationary 
distribution restricted to Sn ■ And convergence is uniform over all initial states. In 
Section 3 we add a second condition that guarantees exponential mixing, calling 
such chains tight, exponentially mixing (t.e.m.) chains. Significantly sharpening 
the results of Section 2, we demonstrate that that the hitting time and the hit 
location are asymptotic, with respect to the total variation distance, to a pair of 
independent random variables, one being geometrically distributed with success 
probability 7r(S'„), and another having the restricted stationary distribution. The 
error term is 0{p{Sn)), see (1.6) for definition of p{-) ■ We extend this result 
to the first k hits of Sn , and then state and prove the claims about the limiting 
distribution of the fi largest values among X{1), . . . , X{N) , useful for fj, = o{N^'^) . 
In Section 4 we apply these claims to the extreme-valued parts of two random 
compositions of a large i^ , the cca composition, and the C-composition. Specifically, 
in Section 4.1 we briefly survey the basic known facts about the compositions. In 
Section 4.2 we show that each composition is sharply approximated, in terms of 
total variation distance, by a related Markov chain, for as long as the current sum 
of parts does not exceed i^ — In ;/. In Section 4.3, for each composition, we derive 
the limiting distributions of the /U largest values of a random composition parts, 
assuming that /i = o(lnz/) for the cca composition, and /x = o{v^''^) for the C- 
composition. In Appendix we prove an auxiliary result on large deviations of the 
number of parts in each of the random compositions. 

2. Tight Markov chains. Consider an ergodic Markov chain X{t) on N 
with the stationary distribution tt = {7r(j)}jgN. Given S* C N, we denote tt{S) = 
^. ^7r(j). Introduce T{S) the positive hitting time of S", i. e. T{S) = min{t > 
0|X(t) € S}, and the hit location X{T{S)). Our focus is on a rare S, i. e. with 
a small 7r(5') . 

Assuming that the chain satisfies a tightness condition (1), namely 

(2.1) lim sup 7 p(i, A;) = 0, 

n— >-oo ^ ^ — ^ 
k>n 

we will show that, uniformly for an initial state in N, (1) T{S) is asymptotically 
exponential, with mean tt~^{S) , and (2) the distribution of X(T{S)) is asymptotic 
to {7:{s)/7:{S)}ses- 

As a first step we prove the following. 

Lemma 2.1. Let a possibly infinite Sn ^ 9 be such that lim„_>.oo vr(5„) = 0. 
Under the condition (2.1), 

(2.2) E,[T{Sn)]^^^^, n^oo, 

uniformly for i E N . 

Note. Consider a simple asymmetric random walk on N, i. e. the Markov chain 
with p(l, 1) = q, p(l,2) = p, and p{i,i — l) = q, p{i,i + l) = p ioic i >2. For p < q 
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this chain is ergodic, with the stationary distribution 7r(j) = (1 — p/q){p/qy~^ ■, 
but it is clearly not tight. For i = 1, T{{n + 1}) = T{{n + l,n + 2, . . . }) , but 
7r({n + l}) 7^7r({n + l,n + 2,...}). So (2.2) cannot hold for aU S'„ with 7r(S'„) ^-0. 
In fact, the expected common hitting time for these two sets is not asymptotic to 
the reciprocal of either of these stationary probabilities. 

Proof of Lemma 2.1. By tightness condition (2.1), there exists K such that 



Y,p{i,j)> 1/2, Vi>l. 



Then, for t>\. 



1 

2* 



P.{T{[K]) >t}<-^ EAT{[K])] < 2. 



Now, one (possibly not the shortest) way of hitting S'„ , starting at i, is to hit the 
set [K] and from there to hit Sn ■ By the strong Markov property, conditionally 
on X{T{[K])) = j, {j G [K]), the residual travel time f(5„) tih hitting 5„ is 
distributed as T{Sn) under Pj . So 

E[f{S^)\X{T{[K])) = j] = E,[T{Sr.)], j G [K]. 

Then, introducing i G [K] such that 

E,[T{Sn)] = Ui8.xEj[T{Sr.)], 

we have: 

E,[T{S^)]<E,[T[K]]+ J2 P^{X{n[K]))=j}E,[T{S^)] 

(2.3) je[K] 

<2 + E^[T{Sn)]; 

in particular, sup^ Ei[T{Sn)] < cxo. 
By Markov property, 

(2.4) EjinSn)] = 1 + J] p{j, k)Ek[T{S^)], j G N. 

fees- 
Multiplying both sides of (2.4) by 7r(j) and summing for j G N, we get 

jeN fee5^ jeN 

= l+Yl ^ik)Ek[T{Sn)], 
fees- 

as 7r(-) is stationary. So, as both series converge, 

(2.5) Y. <k)Ek[T{S^)] = 1. 
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(We note that (2.5) is a special case of a well-known result, due to Kac [13], with 
inevitably harder proof, for a general discrete-time stationary process; see also 
Breiman [7], Section 6.9.) Then, by (2.3), 

(2.6) E,[T{S^)] + 1 > —1— =^ E,[T{S^)] > ^ 



Now, given a state k , we have 

(2.7) Ee[T{Sn)] < Ee[T{{k})] + Ek[T{Sn)], 

T{{k}) being the hitting time for the singleton {k} . Combining (2.6) and (2.7), 
we obtain: for every fixed k , 

(2.8) Ek[T{Sn)^ > -^ 



" vr(S'„) 
Picking arbitrary L, by (2.4), we have: for n > n{L) , 

Ej[T{Sn)] > 1 + J] pU, k)Ek[T{Sn)], j € N. 



k<L 



Therefore, by (2.^ 



liminf ( inf Ej[T(Sn)] I 7r(5'„) > liminf inf > p(j,k), 

n—>-oo VjeN J n—>-oo jgN ^ — ' 



where, by (2.1), the RHS approaches 1 as L t oo. So 
(2.9) E,[T{S^)]> ^ 



vr(S'„)' 



uniformly for j G N. 
It remains to show that 



EAnsn)]< ^ 



7r(5„)' 
uniformly for j € N. Using (2.4)-(2.5), we obtain then 

(2.10) Yl ^^ 1 1 + E pu^k)E,[ns. 



j€S„ ^^^ "■ ^ \ k&S- ) ^ "^ 

Suppose that there exists a subsequence iim -^ oo and S > 0, such that 

lim E4TiSn)]TT{Sn)>l + S. 

n€{nm} 

Then, by (2.7), 

lim Ek[T{S„)]7riS^) > 1 + 5, 

ne{nm} 
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for every fixed k. Picking M > and dropping the summands for A; > M in 
(2.10), we get then: for n = rim large enough, 

(1 + «/2)E^S7tIE*,*)|<1. 



ies„ ^ ^ ' \k<M 



E 7r(i 
This is impossible if M is chosen so large that 

inf ^ p{j, k) > J— 



Therefore 



and so, invoking (2.3), 



- , S/3' 

k<M ' 



Etinsn)] < ^ 



vr(S'„)' 



(2.11) Ek[T{S^)]<^ 

uniformly for A; G N. 

Combining (2.9) and (2.11), we complete the proof of Lemma 2.1. D 

The fact that Ei\T{Sn)\ — > oo already implies, via Keilson's theorem [14], that, 
for each fixed initial state i, T{Sn) / Ei\T{Sn)] is, in the limit, exponentially dis- 
tributed, with parameter 1 . The tightness condition allowed us to estimate the 
scaling parameters Ei[r{Sn)] asymptotically, uniformly for i G N. Interestingly, 
this uniformity can be used for a simple alternative proof of asymptotic exponen- 
tialityof r(5„)/^,[T(5„)]. 

Lemma 2.2. Under the condition (2.1), for each fixed k>l, 

(2.12) E,[T\Sn)]^k\/T,\Sn), 
uniformly for i G N. Consequently, uniformly for i G N, 

(2.13) P^{T{Sn)^^{Sn)>x]^e-\ Vx>0. 

Proof of Lemma 2.2. Introduce the moment generating functions 
As formal power series, these functions satisfy 



(2.14) </),(n) = e^\Y. P^^^^) + E P(^>J>i W h ^ ^ N. 

U6S„ je5= 
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Differentiating both sides of (2.14) k times at u = we get 

E,[T\Sr^)] =bii,k) + Y^ pii,j)E,[T''iSn)], i G N, 
(2.15) fc_i 



r=l ^^"^ j€SS. 



For fc = 1 we get (2.4). Let k >2, and suppose that, for r < k, 
(2.16) E,[T^{S,,)] = (1 + 0(1)) ,_i' . E^mSr.)], 

uniformly for i G N. (This is obviously true for k = 2.) Then 

r=\ 

(using (2.4)) 

=i+(i+o(i))x; i^l; . (i^an^n)] - 1) 



6(^, fc) = 1 + (1 + o(l)) Y, ,1, ; , j; p(z, j)i^, [T(5„)] 

r=i "" ^"^"^ J' ess 



uniformly for i E N. Using (2.17), we rewrite (2.15) as 

EAT\s^)] = (1 + o(i)) ,_^; + j; K^, j)i?,[r'=(5j], 

"^ ^^"^ .es= 
uniformly for z E N. Now if define x^{i, k) = b(i, k) , and, for t > 0, 



x*+ 



i(i,A;) = 6(i,/c)+ J]p(i,j)x*(j,A;), i G N, 
jess 



then x*(j, /c) \ Ei\T^{Sn)\ , i € N. In particular, for /c = 1 , we have 6(i, 1) = 1 , and 
x*(i,l) t Ei[T{Sn)]. Using this observation and (2.17), and Ei[T{Sn)\ ~ l/vr(s„) 
uniformly for z € N, we conclude: 

u\ 

EAT\Sr,)] = (1 + 0(1)) fc_i;^ . E^Sn)], 

uniformly for z G N. Thus (2.16) holds for all r > 1 , and so 

u\ 
E,[T\S^)] = {l + o{l))—— fc>l, 

uniformly for z G N. 

Since limsup/c~^(A;!)^''^//c < oo, the exponential distribution is the only one 
with the moments A;!, (Durrett [10]). The proof of Lemma 2.2 is complete. D 

Turn now to Hn := X(T{Sn)) , H reminding us that X{T{Sn)) is the hit loca- 
tion. 
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Lemma 2.3. Let Un Q Sn- Uniformly for i G N, 



(2.1J 



lim 

n— >oo 



P^{H^ G [/„} 



7r(5'n) 



Proof of Lemma 2.3. By Markov property, 

(2.19) P,{F„ e f/„} =p(«,t/„) + Y. P{hJ)Pj{Hn G ^7n}, « G N, 

where we use the notation p{i, A) = Ylik^APi^-- ^) •> A'^^. 

(a) Assuming only that {p(i, k)} is ergodic, let us show that, for all fixed i,j G N, 



(2.20) 



lim |P,{F„ G t/„} - P,{F„ G t/„}| = 0. 



By Cantor diagonalization device, any subsequence {rtm} of 1,2,... contains a 
further subsequence {?^m^} such that for n —?■ oo along this subsequence, there 
exists 

/, = lim P,{H^ G Un} G [0, 1], i G N 



The limits fi may well depend on {n^}, of course. Letting n = n^i 
(2.19), we obtain: 



oo m 



(2.21) 






Since the matrix {pii,j)} is ergodic, fi does not depend on i, (Durrett [10], Exer. 
3.9). So (2.20) follows. 

(b) By tightness, given e G (0, 1) , there exists J = J(e) such that 

Yp{i,j)>l-e, ViGN. 

For n > n{J), [J] C S^. So, by (2.19), 

inf P,{Hn G [/„} > (1 - e) minP,{F„ G ?7„}, 

iGN i<J 



and 



So 



lim sup 



sup Pi{Hn G Un} <e + maxPi{Hn G C/„}. 

iGN *<-^ 



supP,{i^„ G Un} - inf P,{i?„ G [/„} 

.iGN »GN 

< 2e + lim \maxPi{Hn G C/n} - min Pi{Hn G t/„}l = 2e. 

n— >oo i< J i< J 



Thus 



(2.22) 



lim 



sup Pi{Hn G ?7„} - inf P,{F„ G t/„} 

,iGN *eN 



0. 
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(c) Multiplying both sides of (2.19) by 7r(i), summing for i € N, and using 
stationarity of 7r(-) , we obtain 

Y,<i)P^{Hn G C/n} = J] 7r(i)p(i, [/„) + J] Pj{H^ e t/n} J]^Wp(^'J) 

= 7r([/„) + J] 7T{j)P,{H^ G [/„}, 

J ess 

so that 

(2.23) Yl ^^)PdHn € t/„} = Ti{Un). 



Consequently 



inf P^{H^ e [/„} < g(n,t/„) < sup Pi{Hn G [/„}, 



where 

q{n,Un) :- 



vr(S'n) 
Combining (2.22) and the double inequality we conclude that 



lim 

n— >oo 



Pi{Hn G C/n} 



0, 



vr(5'„) 
uniformly for i G N. The proof of Lemma 2.3 is complete. D 

Thus, considered separately, T(5'„) and X{T{Sn)) asymptotically behave as if 
X{t) is a Bernoulli sequence with each trial outcome having distribution it. Of 
course, the Bernoulli sequence possesses finer properties; in particular, T{Sn) and 
X{T{Sn)) are independent of each other. We are about to impose an additional 
condition on {p{i,k)}. It will be used to to establish a limit distribution of the 
vector (T{Sn),X{T{Sn))) , together with a convergence rate in terms of the || • \\tv 
distance. In particular, under the two conditions, T(Sn)) and X{T{Sn)) turn out 
to be asymptotically independent. 

3. Tight, exponentially mixing Markov chains. The extra condition (2) is: 

(3.1) p:= sup J]|p(i,A;)-p(j,/c)|<2. 

(Of course, p<2 always.) Then (Durrett [10], Exer. 5.11), 

Y^\p-{i,k)-p-{j,k)\<2{p/2r, 

fceN 

where p^{-,-) are the n-step transition probabilities. Consequently, multiplying by 
7r(j) and summing over j G N, 

5]b"(i,fc)-vr(fe)|<2(p/2)". 
ken 
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Equivalently, denoting e = ({1}^^^)^, 

(3.2) ||(P" _e7r)^|U,(M) = \\P^ - enh^^^) < 2(p/2)". 

We call the chains meeting (3.2) exponentially mixing, and use abbreviation t. e. 
m. chains for tight, exponentially mixing Markov chains. Now 

^ Y^ \p{i, k) - p{j, k)\ = min P{X + y}, 

where minimum is over all random vectors {X, Y) such that P{X = k} = p{i, k) , 
P{Y = k} = p{j,k), fc € N, see Durrett [10]. Therefore, selecting independent X 
and Y , 

\ Y. \p{hk)-Pi3M <l-P{X = Y} = l- Yp{i,k)p{j,k). 

fcGN fceN 

Hence the condition (3.1) is met if 

(3.3) 5o:=.mf Vp(z,A:)p(j,A;)>0, 

fceN 
in which case p/2 < 1 — 5o- In fact, for the tight chains the converse is true: (3.1) 
implies (3.3). Suppose not. Then there exists {{ir-,jr)}r>i such that 

(3.4) lim V'p(v,/c)p(jV,A;) = 0. 

r— >oo ^ — ' 

fceN 
By the tightness condition, we may assume that p{ir, •) and p{jr, •) converge, 
weakly, to some probability distributions, pi and p2 respectively, that is 

p{ir,k) ^Pi{k), p{jr,k) ^P2{k), fceN. 
Combining this with (3.1) and (3.4), we obtain 

J]|pi(A:)-p2(A:)|<2, J]pi(A:)p2(A:) = 0. 
fceN fceN 

This is impossible, since the second condition implies that 

\Piik) -P2ik)\ =Pi{k) +P2{k). 

Theorem 3.1. Let dn,i denote the joint distribution of X{T{Sn)) and T{Sn) for 
an initial state i E N. Let dn denote the product probability measure on Sn x N, 
such that 

d^{A xB) = ^^ ■ V 7r(5„)(l - ^(5n))"-\ A C 5„, i? C N. 
Under the conditions (1) and (2), uniformly for i G N, 

(3.5) \\dn,i - dnWxV = 0{p{Sn)), 

where 

(3.6) p(S'„) :=supp(z,S'„). 

ieN 

Proof of Theorem 3.1. Introduce 

£n = supp(z,S'„), p(i,yl) := ^^^(i,/;;); 

by the tightness and lim7r(S'n) = 0, we have lime„ = 0. Let P„ = {p(i, /i;)}i,feg5c . 
As a first step let us prove the following claim. 
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Lemma 3.2. For n large enough, Pn has an eigenvalue A„ G [1 — e„, 1) and a 
corresponding eigenvector f„ = ({/n(0}i6S=) ; such that 

(3.7) 1 < Ui) < J- , 

1 - (6/do)en 

with 6o coming from the condition (3.3). 

Proof of Lemma 3.2. Given ?ti > , introduce Sn^m = Sn^ I'm + 1, ?ti + 2, . . . } , 
so that S*^ ^ = S*^ n [m], which is a finite set. Denote Pn,m = {p{hk)}i,kes^ ^■ 
By the conditions (2.1) and (3.3), there exist uq and nio such that 

(3.8) inf min p{i,S^^) > 1-60/3, 

(3.9) inf min Y] p{i,k)p{j,k) > So/2. 

Let n > no, m > mo- CaU ^ ^ C S"^,^ closed (in S'^nj), if p{i,S^„i \ A) = 
for each i ^ A. Call a closed set minimal if it does not contain a closed subset. 
The condition (3.9) clearly ensures that there exists exactly one minimal closed 
subset A, which may be the whole set S'^^. A submatrix Pa '.= {p{i,k)}i^keA 
is irreducible; so it has a positive eigenvalue \{A) with a positive eigenvector f^ , 
and the absolute values of the remaining eigenvalues of Pa do not exceed A(^) . 
In fact, those absolute values are strictly less than \{A) . Otherwise, by Frobenius 
theorem, there exists a partition A = l+l^^^^r , h > 1, such that, for r = 1, . . . ,h, 
{k e A\3i e Ar, p{i,k) > 0} = Ar+i, (/i + l := 1). So, ior i £ Ai, j £ A2, 

^ p{i, k)p{j, A;) = ^ p{i, k)p{j, k) = ^ p{i, k)p{j, k) = 0. 
fcei fegA fceA2nA3 

And this contradicts (3.9). Furthermore, by (3.8), 

X{A) > mmp{i,A) = mmp{i,S^^) > 1 - (5o/3, 

while, denoting A' = S!^ j^\ A and using (3.9), 
uiduxpii, A') < 1 — minpU, A) 

ieA' iGA' 

<1— min > p(i,k)p(i,k) 

~ i€A',j€A^^^ '^ ' 

fceA 

<1— min > p(i,k)p(j,k) < I — 60/2. 



n.m 

ict— r, 

rL,m 



Therefore A(^) is strictly larger than X{A') , the largest eigenvalue of Pa' ■ Denot- 
ing Pa',a = {pii,k)}ieA',keA, let f^' be a solution of 

PA',AfA + PA'fA'=XiA)iA'. 



TIGHT CHAINS 



15 



Since Pa',a^ > and A(^) > A(^'), f^/ exists uniquely and is positive. The 
combined vector in,m = [^a-,^A') is a unique, positive, eigenvector of Pn,m for its 
largest eigenvalue 

K,^ = \{A) > 1 - 6o/3. 

Let us bound maxjf„^m(0/™i'^i fn,m(0 • Introduce Piir) , {i G S!^j^,t > 0), 
the probability that, starting at state i, the Markov process X{t) stays in S"^ ^ 
for all t < T . The sequence p(t) = {pi{T)}i^s^ „ , satisfies a recurrence 

p(T + l)=P5c_p(r), p(0) = (l,...,ir. 
Moreover, there exists Cn,m > such that 

(3.10) p{t) rsj Cn^rnK,m^n,ra, T ^ OO. 

To exploit this connection, let us first use a coupling device to derive a recurrence 
for the differences Pi(r) — Pj{t) , i ^ j ■ Consider two independent processes, X{t) 
and Y{t) , starting at i and j in 5*^ ^ . Introduce the events C/(t) = {X{t) G S*^ ^} 
and V{t) = {Y{t) G 5"^,^}, and let 1(1^) denote the indicator of an event W . By 
the Markov property. 



Pi{r + 1) - Pj{t + 1) = -E(i,j) 



n i(f/(i))- n i(^w) 



f<T+l 



t<r + l 



^ p{i,ki)p{i,k2)E(^kuk2) 



n 1(^(0) -ni(^w) 



t<r 



«T 



+ X] p{i,h)p{i,k2)Ek, 



U^iUit)) 



t<T 



y^ p{i,ki)p{i,k2)Ek 

kies„,m, fe2e>s= 



n i(^w) 

^ p(i,fei)p(i,/c2)[Pfei(T) -Pfc2(r)] 

'2es^,m 

■17^ '=2 

+ P(i,5'n,m) ^ p(i,A;i)pfci(r) - p{i,Sn,m) ^ P{j,k2)pkA'^) 



fci,fc2eSn,m 

fcl#'s2 



fc2e5£; 



Letting r t oo and using (3.10) we obtain 

(3.11) \n,ra{fn,m{i) - fn,ra{j)) = ^ P{i,ki)p{i ,k2){fn,m{kl) - fn,ra{k2)) 



«l,K2tan,m 



+ P{i,Sn,m) ^ P{hki)fn,m{kl) - p{i, Sn,ra) ^ P(j, ^2)/n,m(^2) 



fcie5- 



fc2es= 
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Let fn,ra{ii) = Hiaxig^c^^ fn,mii), fn,m.{i2) = miiiigsc^ fn,m{i)- Then it follows 
from (3.11) that 



(/n,m(«l) - /n,m(«2)) K,m. - ^ P(n , fel)p(«2, ^2) < P{i2, Sn,m) fn 



[H 






Here, see (3.3), 

^ p{ii,ki)p{i2,k2) < I- ^ p(zi,fe)p(j,/c) < l-(5o/2. 






and 



P{i2, Sn,m) < £n,m ■= maX p{i, Sn,m)- 



As Xn,m > 1 — <^o/3, we obtain 

(3.12) fn,m{h) ( 1 - Y£n,m j < /n,m(i2), Vn > Uq, Vm > mQ. 

Now 

< en,m - en < SUp ^ p(i, /c), 

so that liniTO^oo eji,m = ^n, uniformly for n, and lim„_j.oo e„ = 0. So there exist 
ni > no, mi > mo such that en,m < 5o/7 for n > ni , m > mi . For those n,m, 
the relation (3.12), with fn,m{i2) = miuj fn,m{i) = 1, yields 

(3.13) l</n,n^(i)< ^ .„ /. . , i ^ S'^^^. 

A standard argument shows then existence of a subsequence m^ t oo such that (1) 
for each i G S*^ , there exists fnii) = linims^oo fn,msi''') ) which necessarily satisfies 

1 - (b/dojen 

and (2) there exists A„ = \\va.rns^ooXn,ms £ [1 ~ En,!)- Clearly then f„ := 
({/n(i)}igsc ) G Loo (5*^) is an eigenvector of P„ , A„ being a corresponding eigen- 
value. The proof of Lemma 3.2 is complete. D 

Let Fn be a diagonal S^ x S*^ matrix with Fn{i,i) = fn{i) , i & S!^. Define a 
S^ X Sn matrix 

Q„ = X-'F-'PnFr, = K'{{Ui))-^p{hk)Uk)},,kesi. 

rp 

Let e„ = ({l}ig5c) . Since F„e„ = f„ , we have 

O p — A~^F~^P f — F~^f — p 

^n<^n — ^n ^ ri ^ n '-n — ^ ri i^ — ^n? 
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SO that Qn is stochastic. From tightness of P = {p{i,k)}i^k&i-, and (3.9) it follows 
that, for each fixed n, and even uniformly over n, Qn is tight as well, i. e. 

5{K):= sup ^ g„(i,A:)^0, K \ oo. 
Now from 

{QnYil^k) = Y, Qn{i,jKQnr-\j,k), 

by induction on v it follows that 

sup^ ^ {Qnni,k)<d{K). 

Hence, given n, the rows of all matrices {QnY form a tight set of probability dis- 
tributions. Therefore there exists z^., — t- cx) and a family of probability distributions 
7r„(i,-) on S*^, (i G S*^), such that, for i,k £ S^, 

(QnVihk) ^ Trn{i,k), z/ -^ oo. 

In addition, by (3.7) and (3.9), for i,j € S!^, 

Yl Qnii,k)QnU,k)=X-' Y 4^Pii^k)p{j,k) 

> (1 - [6/60)6^^60/2 > 60/3, 

for n large enough. Therefore, cf. (3.2), 

(3.14) Y \iQnnhk)-{QnnJ,k)\< 2(1 -60/Sr^O, u-^^. 
kesi 

Letting z/ — )■ co along {vs} in (3-14) we obtain that the family {7r„(i, •)}ig5c 
consists of a single probability distribution 7r„(-) on S*^ . Thus, for any distribution 
q{-) on S*^ , qiQnY" —^ T^n ■ Applying this to g = 7r„ , and then to q = UnQn , 

7r„<5„ = lim 7rn{QnV"Qn= lim (7r„(5„)((5„)'''' =7r„, 

that is Tin is a stationary distribution of Q„ . Using stationarity of 7r„ and (3.14) 
we obtain 

(3.15) II [{QnY - e„7r„]^||^^(^,^ = ||(Q„)^ - e„7r„||^^(^,^ < 2(1 - (5o/3)^ 

cf. (3.2). Since {QnY = ^n" ^n^Pn^n, Combination of (3.3) and (3.2) implies that 

(3.16) (PnY = K ^nO-n + Rn,u, (^n ■= {tT^ (i) //„(?) jig^c , 

where 

(3-17) l|-^n,i.|L,(5c) = \\Rn,u\\L^iS-J < 2(1 - 60/3)". 
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The estimates (3.16)-(3.17) enable us to determine the hmiting joint distribution 
of T{Sn) and X{T{Sn)) ■ Given AQ Sn, and z with \z\ < 1, define 

and 'tp{z) = [{^i(^)}ig5<;] • Using Markov property, we have 

tpiiz) = zp{i,A) + z "^ p{i,k)'4)k{z), i G S"^, 

or 

rp 

tp{z) = zpn + zPnipiz), p„ := [{p{i, A)}i^s-„] . 
Therefore, introducing the S*^ x S*^ identity matrix /„ and using (3.16)-(3.17), 

iP{z)=z{In-zPn)-'pn = zY,z''iPnrPn 

(3.18) =- — fn(cr„p„) +R„(z); 

i — ZAn 



R„(z) := z'^ Z" Rn,:,Pr. 



u>0 

By (3.17) and 

\\Pn\\L^.{s-j <P{A) ■.= supp{i,A), 

iGN 

we have: each component of Rn(-2) is analytic for \z\ < (1 — 5o/3)~^ , and 



"*-''"'-"'?■> - i-N(i-V3)- 



Therefore each il^i{z) initially defined in the unit disk admits a meromorphic 
extension to the open disk of radius (1 — 6o/3)~^ > 1, with a single, simple pole 
z = 1/Xn in that disk. 

As for the explicit term in (3.18), 

[U^nPn)], = CiA}Uiz), CiA) := Y^ ar,iJ)p{j,A). 

In particular, setting z = 1, 

PdXinSr.)) € A} = M^) = £iSMl + o{piA)). 

Since Pi{X(T{Sn)) G Sn} = 1, we then obtain 

(3.19) i_An = £(^i)M), e.{n) = 0{p{Sn)). 

Therefore 

PdXinSn)) €A} = ^1^ + OipiSn)), i G S'n. 
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This uniform estimate and (2.23), with Un = A^, easily imply that 

(3.20) ^ = ^+0(p(5„)). 
Furthermore, given a positive integer r, 

PdnSr.) = r, XiTiSr,)) eA} = [z-]^,(z) = -L ^ ^ dz, 

where L is a circular contour |z| = 1. By (3.18), the extended ipi{z) has a unique 
singularity, a simple pole, in a ring between L and Li , which is the circular contour 
of radius (1 — (5o/4)~^ . Using (3.18) and the residue theorem, we obtain 

= C{A^)Ui)Xr' + 0((1 - 6o/4yp{A)) 
Thus, by (3.19) and (3.20), 

(3.21) PdnSn) = r, X{T{Sn)) € ^} = (1 - A„)Ar'4^ 

+ o[(i - Xr,)xrMSn)] + o[(i - 5o/4ypiSn))]. 

In particular, for A = Sn, 

(3.22) P.{r(5„) = r} = (1 - A„)Ar' 

+ 0[(1 - A„)Ary5n)] + 0[(1 - 5o/4)X5„))]. 

Now we had proved already that, under the tightness only, 

so that 

E,[{T{S„))k] ~ A;!^-'=(5„). 

According to (3.22), we also have 

E,[{T{S^)),] = ^Y^AJfc + 0{p{S^){l - AO-'^). 

Comparing the two formulas we see that 1 — An ~ 7r(S'„) . In fact, we can say more. 
From (3.22) it follows that, uniformly for z G 5^, 

1 — An 

Combining this with (2.10), we get 

(3.23) l-Xn = TT{Sn){l + 0{p{Sn))). 
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The rest is short. Let C C S'n x N, and Cr = {k (£ Sn : (/c,r) € C}. From (3.21) 
it follows that, uniformly for i G 5*^ and C , 



(3.24) P,{{XiTiSr,)),TiSn)) £C} = J](l - A„)A; 
And, by (3.23), 



-i^(a 



T& 



tt{S„ 



+ 0{p{Sn)). 






t6F 



tt{S„ 



< |(l-An)-7r(5'n)| ^x^~^[l + r(l-x)] — TTTT) (^^ between An and l-7r(5'n)) 



reN 



7r(S'„ 



<|(l-A„)-7r(5„)| 



2(1 -x)Y^ Tx^-^ + 2 XI ^^~^ 



r>l 



r>l 



1(1 - AJ - TT{Sr,)\ ■ 4(1 - X)-' = 0(p(5J). 



So (3.24) becomes 

\\dn,i-dn\\TV = 0{p{Sn)), i & S^. 

Suppose that i G Sn- Then 

(3.25) P,{(X(T(5„)),T(5„))gC} 

= ^p{i,k)Pk{{X{T{Sr,)),T{Sn) + l)eC} + 0{p{Sn)), 



kes- 



where, by (3.24), 

Pk{{x{T{Sn)),T{Sn) + i)ec} = J](i - a„)a; 



-i<C. 



T+1, 



tGN 



TT{Sn 



0{p{Sn)) 



J](l-A„)Ar'^ + 0(p(5„)) = J](l-A„)Ar'^+0(l-A„+p(5„)) 



T>2 






r>2 

.-1 ^(g) 

vr(S'„) 



+ 0(l-A„+p(5„)) 



tGN 



vr(S'„) 



Therefore, by (3.25), (3.24) holds for i € S'n as well. This completes the proof of 
Theorem 3.1. D 

Let Tn^r be the time intervals between consecutive visits to Sn- So T„^i = 
T{Sn) , and, for r > 1 , 

Tn,r = min{t > Tn,r-1 ■ X (t) € Sn} - T„,r-1, 

'n,r — 1 • / ^ ^ n,k-i 
k<.r 
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i. e. Tn,r is the time of r-th visit to Sn- Let Xn^r = X(Tn,r), i- e- -^n,r is a 
state in Sn visited at time Tn,r- Introduce a random sequence {ir',tr}r>i, where 
all ii,ti,i2,t2, ■ ■ ■ are independent and, for each r , 

vr(A„j 

while tr is distributed geometrically, with success probability 7r(5'„). Also, for two 
random vectors, Y and Z, of a common dimension v , let dTy(Y,Z) denote the 
total variation distance between the distributions of Y and Z, i. e. 

dTv(y;Z)= sup \P{Y e B} - P{Z e B)\. 

Since |x| is convex, 

0.5 sup |i?[/(Y)]-£;[/(Z)]|<dTy(Y;Z) 

/:||/IIloo(n'')<1 

< sup \E{f{Y)\-E{f(Z)\\. 

Theorem 3.1 implies the following. 
Theorem 3.3. Uniformly for an initial state z G N, 

(3.26) dTv{{Xn,r', Tn^r}l<r<k', {^r ; ^r }l<r<fc) = 0{kp{Sn)), 

Thus, if k = k{n) is such that kp{Sn) — ?• 0, the random, sequence {Xn,r', 
Tn,r}i<r<k is asymptotic, with respect to the total variation distance, to the 
Bernoulli sequence {ir,tr}i<r<k ■ 

Proof of Theorem 3.3. We prove (3.26) by induction on k. For /c = 1, it is 

the statement of Theorem 3.1. Assume (3.26) holds for some k > 1. Let / : 
p|fc+i xN'^+i ^M have ||/||l<»(n'«+ixN'=+i) < 1- Denote 

X = {Xn,r}l<r<k + 1, ^ = {Xn,r}l<r<k, 

(3.27) Y = {Tn,r - 6{i, r)},<r<k+i, Y^ = {r„,, - 5{i, r)},<r<k, 

X^*^) = {Xr}l<r<k, y^^^ = {yr}l<r<fc- 

We write first 



(3.28) E,[f{X;Y)] 

= £;,[^4/((x('^),X„,fc+i) ; (YW,r„,fc+i - 1)) I (X^'^), Y^'^))]]. 

By the strong Markov property, 

(3.29) Ei[/((X<'>,X„,t+,) ; (Y<''->,T„,t+, - 1)) | (X'", Y<'>)] 
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here X, T are the location and the time of the first hit of Sn for the chain starting 
at Xk & Sn- Using (3.26) for A; = 1 , we have 

(3.30) £;.J/((x«,X),(y«,r-l))] -i?[/((xW,^),(yW,t))] = 0(p(5J), 



uniformly for j £ Sn- (Here (£, t) = {£r,tr) •) So, introducing / : N^ x N^ — > IR by 

/(x('^),y(^)) = i?[/((x«,£),(y«,t)) 
and using (3.28)-(3.30), we have 

(3.31) |£;4/(X,Y)]-i^4/(X«,Y«)]|=0(p(5„)). 
Besides, applying the inductive hypothesis to / , we also have 

(3.32) £;i[/(X«, Y«)] - E[f{{£r,tr}l<r<k)] = 0{kp{Sn)). 

It follows from Fubini theorem and (3.31)-(3.32), that 

E,[f{y.,Y)\ - E[f{{l,,trh<r<k + l))\ =i?.[/(X,Y)] - E[f{{U,tr}l<r<k))\ 

= Oiik + l)piSn)), 

which proves the inductive step. So (3.26) holds for all k. D 

Let us apply Theorem 3.3 to the extreme values for the t.e.m. chains. Given 
a large N, let X'^^^ = X^^'^'> denote the j'-th largest among X{1), . . . ,X{N); in 
particular, X^^> = maxi<t<Ar X(t) . From now on we will use a notation Sn = 
{n + l,n + 2,...}. 

Corollary 3.4. Uniformly for i = X{0) , 

(3.33) P,{X(^) <n}= P{Poisson {N7r{Sn)) < fi} + 0{fi^ /N + Np^{Sn))- 

Proof of Corollary 3.4. X^^^ < n iff during [l,N] the chain visited Sn at most 
/i — 1 times. So, by Theorem 3.3, 



(3.34) P,{X(^) <n} = Y, (^.)^'{Sn){l - 7T{Sn)f--' 



+ 0{MSn))- 



Here 
and 



(iV),-(l - 7r(5„))-^' = N^{1 + Oiii^N) + OifipiSn))), 



(1 _ niSn))'' - e-^"(^") < 27Ve-^"(^") (g— C^-) _ (l _ vr(5„))) 

<iV7r2(5„)e-^-(^")). 
So, as 7r(S'„) < p{Sn) , (3.34) becomes (3.33). D 

Corollary (3.4) is a special case of the following result. Given a < 6 < oo, denote 
Sa,b = (a, b] , i. e. Sa,b = Sa\Sb- Let 14, b = VN,a,b denote the number of visits to 
Sa,b during [l,iV], and Xa,b = >^N,a,b = Nn{S{a,b)). 
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Theorem 3.5. Let (ai, bi], . . . {ak,bk] be disjoint. Uniformly for i = X{0) , 



(3.35) 



Pi {Va,,bi < f^i} \ = JJ P{Poisson {Xa,,bJ ^ A*4 

l<£<k J l<e<k 

where ^ = ^i + ■ ■ ■ + fik , a = min^ a£ . Thus, if ^ = o{N^^'^) and Np'^{Sa) = 
o{l) , the numbers of visits to non- overlapping intervals (a^jfc^] are asymptotically 
independent Poissons with parameters NTT{Sai,be) ■ 

Proof of Theorem 3.5. Applying Theorem 3.3 to 5 := [J Sai,bt , 

KKk 



I i^'^i- I -■ ^ 'J.i;. .. ■,jk<fJ.k ^ '^ l<e<k 

1- Y. ^(•5a.,bj) +0(M'5a)), 



l<£<fc 

where J = ji + • • • + J^ , and 

N \ m 



\Ji,---JkJ ji!---Jfc!(^-j)!' 
The rest runs parallel with the proof of Corollarry 3.4. □ 

Analogously we obtain a relatively simple asymptotic formula for the joint dis- 
tribution of x(i),...,x(^). 

Theorem 3.6. Let oo = no > ni > 71-2 > • • • > n^ . Uniformly for i = X{0) , 
(3.36) pA fl {xW<n,} 

Yl llP{Poisson{X^^,^^_,) = i^r} + 0{fi^/N + Np\Sn,)). 



1/1 .....l^u 



More generally, let 

B <Z {x = (xi,... ,x^) e N : xi > • • • > x^}. 

Given x, /et yi(x) > ••• > ym{^) denote all the distinct values (range) of the 
sequence xi, . . . ,x^, and let aj = aj(x) > be the multiplicity of yj = yj{:>i.) . So 
m = m{x) < fi, and ai + • • • + Om = ^. Then, denoting n{B) = inf x^ , and 

setting y^ = 00 , 

(3.37) Pi{(x(i),...,x(^)) es} 

xG-B l<r<m '" 

x€-B l<r<m 
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where a{y) = 7r(y)/7r([y„, oo)) , y € [y^, oo) . 

In the next section we will describe two models of a random constrained com- 
position, and show that each random composition is sharply approximated by a 
t.e.m. chain. It will enable us to use Corollary 3.4 and Theorems 3.5, 3.6 for anal- 
ysis of the limiting distribution of the larger parts. 

4. Two random constrained compositions and Markov chain approxima- 
tions. We focus on two interesting cases of such compositions, the column-convex- 
animals (cca) compositions and the Carlitz (C) compositions. 

4.1. Defintions and some basic facts, (a) A column-convex animal (cca) is a 
sequence of contiguous vertical segments of unit squares in W? , ordered from left to 
right, such that every two successive columns have a common boundary consisting 
of at least one vertical edge of W? . If the total number of unit squares involved is v 
then the lengths of the vertical segments form a composition of z/ ; we call it a cca 
composition. Let T{v, fi) denote the total number of the cca compositions of v with 
// parts; then T{i^) := ^ ^^ T{i',ii) is the total number of the cca compositions of 
v. Introduce f{w,z), the bivariate generating function (BGF) of T{iy,n), 



f{w,z) = Y^ T{u,fi) 



II ^v 



Louchard [19] found that 

wz{z-lf 
(4.1.1) h{w,z) 

h{w, z) := z^{w -1) + z^{w'^ -w + A) - z^{w + 6) + z{w -h 4) - 1. 

Therefore f{z) , the GF of T{u) , is 

z{z-lf (-z-l)^ 



(4.1.2) f{z) = f{l,z) 



h{l,z) 4z^ -7z^ + 5z-l' 



a formula discovered earlier by Klarner [15]. Privman and Forgacs [22] used (4.1.2), 
and Darboux theorem, to show that 

(4.1.3) Tiu) = ^{l + 0{Y)), 

where 7 < 1 , C = 0.18 . . . , and z* = 0.31 ... is the smallest-modulus solution of 
h{l,z) = 0. 

We get a uniformly random cca composition of z/ , if we assume that each com- 
position has the same probability, \/T{v). It was discovered in [19], [20] that the 
distribution of the last (first) part is asymptotic to 

vri(/c)=4(/c + a), 

(4.1.4) 1-z, 1 

a := = 0.75 . . . , 
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which is directly seen as a probabihty distribution. Besides, the joint distribution 
of two consecutive parts Yt and y^+i , with both t and Ai — t of order 0(z/) , was 
shown to be asymptotic to that of two consecutive states of an ergodic Markov 
chain on N, in a stationary regime, with transition probabihties 

(4.1.5) p^i^k) = zUi + k-l)'^, 

and a stationary distribution 

(4-1-6) A._\^^kn. , .^2_ ^* ,1-^* 



A:=Y.-t{k + af = -^-y^ + 



fc>i '^ ^*^ ^* 

That X]fc>iP(^'^) ~ -'- fohows from another formula for a, 

2zl 
a 



[I - 2z,){i - z^y 



(The given formulation is slightly different from, but equivalent to that in [19], [20].) 
One way to derive (4.14) is to use (4.1.3) and a formula for fk{z), the generating 
function of the cca compositions with the first (last) part equal k, 



(4.1.7) Mz) = z'' + z'f{z) 



k + 



z^ — z"^ + z 



which can be read out of [19]. Comparing the first line in (4.1.4) and (4.1.7) we 
must also have yet another formula for a, namely 

(4.1.8) ^^^-zl + z. 



z,r 



which is indeed the case. 



(b) A Carlitz (C) composition of u is defined as a composition such that every 
two consecutive parts are distinct from each other. The counterparts of the cited 
results for the cca compositions are as follows. Carlitz [8] proved that 

fiw,z)=-l ^ 



h{w, z) ' 

hiv.,z):=l-Y^i-iy^^^; 
i>i 

for \w\ < 1, h, as a function of z is analytic for |z| < 1, and for \w\ > 1, h is 
analytic for |z| < l/|t«| . Louchard and Prodinger [21] found a rather more tractable 
expression for h, namely 



(4.1.10) h{w,z) = l-J2Y 



i>i 



WZ-' 

+ wz^ 
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(4.1.9) and (4.1.10) were used in [20] to show that 

(4.1.11) T{v) = ^{l + 0{Y)), 

where 7 < 1 , C = 0.456 . . . , and z* = 0.57 ... is the smallest-modulus solution of 
h{l,z) = 0. 

We get a uniformly random C-composition of u, if we assume that each C- 
composition has the same probability, 1/T{v) . In a striking analogy with the 
random cca composition, the two consecutive parts Yt and Yt^i , deep inside the 
composition, are also jointly asymptotic to the two consecutive states of an ergodic 
Markov chain, with transition probabilities 

f fc l + < . ,, 

(4.1.12) Pii,k) = l '* l + z^ '^ ' 

[o, i = k. 

and a stationary distribution 

k k 

(4.1.13) ^{k) = A- \ ^* ,, , ^:=\"— -fi— -. 

And the limiting distribution of Yi is 

(4.1.14) 7ri(A;) 



Z^ 



l + zl' 
which follows from (4.1.11) and a counterpart of (4.1.7), 



yK-\- J- yfl, 



(4-1-15) A(^) = T^7^ + /(-)Y 



-jfc 



(That (4.1.12) and (4.1.14) and are indeed probability distributions follows from 
the definition of z* as a root of /i(l,z) = and (4.1.10).) 

For each of the compositions, an equation h{w, z) = (for the attendant function 
/i(w, z) ) determines a root z{w) , well defined for w sufficiently close to 1 , such 
that 2:(1) = Zh< , z{w) is infinitely different iable, and z'{l) < 0. The number of 
parts Ai for each of the random compositions was shown, in [19] and [21] resp., to 
be Gaussian in the limit z/ — t- 00 , with mean ai/ and variance /Si^ , where 

z'(l) z'(l) , z"(l) 
(4.1.16) a = f-f = ^, /3 = a^ + a ^. 



In particular, 

12z2 - I4z, + 5 



(4.1.17) a 



0.45 ... , (for cca) 



^* ~r ^* ^* ~r Z^ 

^ 7'>1 (T-U^^\2 

(!+-.)_ ^0.35..., (forC); 



needless to say, in each case z* is the root of the corresponding equation /i(l,z) = 0. 
In Appendix we will prove the following large deviation result. 
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Lemma 4.1.1. For each of the compositions, there exists an absolute constant 
c > such that 

P{\M - av\ >s}< cz/exp(-sV3/3j^), 

provided that s = o{v) . Thus 

(4.1.18) P{\M-au\<u^^'^\nv}>l-u-^, \f K > 0. 

Note. Borrowing a term from Knuth et al. [17], the event on the left of (4.1.18) 
happens quite surely (q.s.). 

4.2. Approximating the random compositions by the Markov chains. The 

results cited above strongly suggest, though not actually prove, that the random 
cca composition and the random C-composition considered as random processes are 
each asymptotic to its own Markov chain, defined in (4.1.4)-(4.1.5) and (4.1.11)- 
(4.1.13) respectively. 

The following theorem confirms this natural conjecture. 

Theorem 4.2.1. Let Y = {Yt}t>i be either the random cca composition, or the 
random C-composition of v. Let Z = {Z(t)}t>i be the corresponding Markov 
chain with the transition probabilities p{i,k), and Z{1) having the distribution 
{7ri(i)}i>i. Introduce 



M = max {l<m< M : Yi H ^Y^ < u -\n^ v) . 

M = max {m > 1 : Z{1) H h Z{m) <u -\d^ v]; 



in particular, Ai € {A4 — In z/ — 1,A4) . Let d and d denote the probability dis- 
tribution of {M, (Yi,...,!^)) and (M, (Z(l), . . . , Z(M)) respectively. For each 
chain, 

(4.2.2) \\d-d\\Tv = 0{u-^), VK>0. 

So, the random composition of v , read from left to right, is closely approximated 
by the corresponding Markov chain, as long as the accumulated sum of parts stays 
below 1/ — In v . (A restriction of this sort is unavoidable: like the first part, the 
last part of the random composition has the distribution vri , which differs from 
the stationary distribution vr.) Now, we will see that, with high probability, the 
extreme-valued parts are in this "bulk" of the composition, implying that they 
are well approximated by the extreme- valued states of the M-long segment of the 
corresponding Markov chain. It is easy to verify that 

■s-^ I Oiz^n^), cca chain, 

(4.2.3) sup5]p(.,fc)= : p . . 

* k^n yO{z^), C-cham, 

where z* = 0.31 . . . for the cca chain and z* = 0.57 . . . for the C-chain. That is, 
the chains meet the tightness condition (2.1). And the exponential mixing property 
in the form of (3.3) is easily verified as well. So we are able to use Corollary 3.4 
and Theorem 3.6, say, for derivation of the limiting distribution of those extreme 
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values, and then the last theorem for a quick proof of the corresponding results 
regarding extreme -valued parts of each of the random compositions. 

Turning the tables, we can also use Theorem 4.2.1 and Lemma 4.1.1 to determine 
the very likely bounds of M with sufficient accuracy. Since M € (A^— In iy—l,A4), 
Lemma 4.1.1 implies that q.s. 

\M -av\ < Iv^l'^X^v. 

So, applying Theorem 4.2.1, we immediately see that 

(4.2.4) \M -av\<2v^l'^\uv 

q.s. as well. (!) 

Proof of Theorem A:.1.\. The key element is the following claim. 

Lemma 4.2.2. Let Y he either the random cca composition or the random C- 
composition of v . Let k > 1, i = (ii, . . . ,ifc) G N^ , where ii + - ■ ■ + ik < i^ ■ Denote 
P^{i) = P{Yi=i^,...,Yk=ik} and P{i) = P{Z{l) = ii,...,Z{k)=ik}. Then, 
uniformly for k and i, 



(4.2.5) P,(i) = P(i)exp(o(fc7"-l'l)), |i| = ii + 

where 7 comes from either (4.1-3) or (4-1.10). 



Ik, 



Proof of Lemma 4.2.2. Let Y be the random cca composition of v . We will 
prove (4.2.5) by induction on k. 
For k = 1, 



(4.2.6) 



PAii) 



[^1/nW 



[^1 f{z) ' 
where f[z) and fi^ {z) are given by (4.1.2) and (4.1.7) respectively. Here, by (4.1.3), 

(4.2.7) [z^]f{z)=T{v) = ^e^v{0{r)). 

Further, by (4.1.7), 



[z'']f\,{z)=5,,,, + [z^-'^]f{z) 



ii + 



z^ — z"^ + z 



[l-z] 



:t,T{u~i^) + [z^-^-]f{z) 



z"^ - z^ + z 



(4.2.8) 



:H^exp(O(7-0) + ^4r:%^exp(O(7-0)- 

Zjf Zif \^ <- j 



(z* is the smallest modulus pole of f[z){z^ — z^ + z){l — z) ^ , as well.) It follows 
from (4.2.6)-(4.2.8) and (4.1.8) that 



Pu{ii)=z^ 



^^ Z^ \ -2^* 



H 



exp(O(7'^-^0) 



{l-z.f 
z:^(zi+a)exp(O(7^-*0)=^(^i)e^P(O(7""^0)' 
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which is (4.2.5) for k = 1. 

Suppose that (4.2.5) holds for some A; > 1. Let i = (ii, . . . ,ik+i) be such that 
|i| < u. Let i' = {i2, ■ ■ ■ ,ik+i)', then |i'| < v — ii. Let T(i, z^) and T(i',z^ — ii) 
denote the total number of the cca of area v [u — ii resp.) with the first /c + 1 
parts ii, . . . , ik+i (the first k parts ^2) • • • ) ^fc+i resp.). By the definition of the cca 
composition, 

r(i,z/) = (ii + i2-i)T(i',z/-ii). 

Therefore 



P.{^) 



[z-]/,,(z) T{i^-h) {i,+i^-l)T{i',u-h) 



ii +a 
ii + a V 

= p(zo !i±ipl p(i') exp (o {{k + 1) 7 ^-'^i; 

and we observe that 

k 
^ ^(l ) = ^ ^(^2) [[P[ir, V+l) 

zi + a ii+a -^-^ 

r=2 

=p{k,i2) JJp(v, V+i) = np(v, V+i) 

r=2 r=l 



P,(i) = P(ii) n p(v, v+i) exp (0((fe + 1)7 ^-1^ 



Hence 



r=l 

= P(i)exp(o((fe + l)7^-l^l))' 

which completes the inductive proof of (4.2.5) for the random cca composition. The 
proof for the random C-composition is similar, and we omit it. □ 

Lemma 4.2.2 implies the bound (4.2.2) of Theorem 4.2.1 without much difficulty. 
Consider, for instance, the random cca composition of u . Let m, i = (ii, . . . ,im) 
be given. Clearly 

P{M >m,Yi = ii,...,Ym = im} 

= P{M > m, Z{1) = ii, . . . , Z{m) = im] = 0, 

unless |i| < z^ — In v . In the latter case m <v — h\ u , and, by Lemma 4.2.2, 

P{7W>m, yi=n,...,y^ = i„} = P(i)exp(o(m7^-l'l; 

=P{M>m, Z{l) = H,...,Z{m) = i^jexp fo(z/ 7'"' ^ 
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uniformly for m and i in question. Consequently, uniformly for all m and S C N" 

P{M>m, {Yi, . . . ,Y^) e B} 
=P{M > m, (Z(l), . . . , Z{m) € B} exp (0(7°-^'"' '') 

whence 
(4.2.9) 



P{M = m, {Yi,...,Y^)£B} 
--P{M = m, (Z(l),...,Z(m)) e B} + Oh^'^^''^ •") 



Let D C N"^"*""^ be given. For zGN^, k < v + l, we write z G D if z is a projection 
of a point in D on the first k coordinates. Noticing that M < i^ and M < z^ , we 
obtain from (4.2.8): uniformly for all D G N'^'^^ , 

p[{M,{Yi,...,Y^))€d} 

=P{(M, (Z(1),...,Z(M)))gZ?} + 0(7°-^1"'^). 

This completes the proof of Theorem 4.2.1. D 

4.3. Limiting distributions of the extreme parts of the random compo- 
sitions. By (4.2.3), for each of the two chains, q.s. 

(4.3.1) Ni + 1<M<1 + N2, 7Vi,2 = [aJ^±2z/^/2lnz/J. 

So q.s. the extreme values of {Z{t)}Q^^^j^ are sandwiched between those of 
{Z{t)}o<^t<Nj+i and {Z{t)}o<t<N2+i- Picking a generic N G [Ni,N2], introduce 
{X(f)}o<t<Ar = {Z(t)}o<t<Af+i • Here X{0) has distribution 7ri(-). 

Let X(^) be the ^-th largest among X{t), t G [l,iV], for X(0) = i, i G N. By 
Corollary 3.4, 

(4.3.2) Pi{X(^) <n} = PJPoisson (7V^(5„)) < ^l} + 0[^? /N + Np'iSn)), 

where 

7r(S'„) = ^7r(A;), p(S'„) = sup ^p(z, /c). 

k>n k>n 

Here p{Sn) = 0{n'^z'^) for the cca chain, and p{Sn) = 0{z'^) for the C-chain, see 
(4.2.3). (Again, z* = 0.31 . . . for the cca chain, and z^ = 0.57 . . . for the C-chain.) 
Turn to 7r(5n) . For the cca chain, by (4.1.6), 

(4.3.3) ^(5.) = n2z:i?(l + 0(n-i)), B := f^~^*^\, . 

z^. -\- \i z^, ) 

For the C-chain, by (4.1.13), 

(4.3.4) 7T{Sn) = Bz:{l + Oiz:)), B:=A-^-^^. 
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Lemma 4.3.1 (cca chain). Suppose that 

4.3.5 n = -^ -^—4 ^ G N, 

where A = o(lnN) . If ^ = o(ln A'^) , then, uniformly for i G N, 

(4.3.6) P,{X'^^'^ <n} = P{Poisson (A) < ^} + 0[{\ + ^i) / \n N] . 

Equivalently, define Wj^^^ by 

ln(l/z*) 
then, for s = o(ln N) such that 

(4.3.8) -^ -^— ^ ^ G N, 

ln(l/z*) 

we have 

(4.3.9) P.{W^iv,M > s} = P{W^ >s} + 0[{s + fi)/lnN] ; 

/lere W^ = Vi + ■ ■ ■ + V^ , and Vi , . . . , V^ are independent exponentials with unit 
mean. 

This Lemma implies the following cruder result. (We use a symbol Op{l) to 
denote a random variable bounded in probability as N —?■ oo.) 

Corollary 4.3.2. If fi = o(ln A'') , then, uniformly for i G N, 

, , Inf/i-iTVln^AT) 

(4.3.10) x<^. = J^i__J + o,(l). 

Here are the counterparts for the chain associated with the random C-compo 
sition. 

Lemma 4.3.3 (C-chain). Suppose that 

4.3.11 n= ; / G N, 

ln(l/2;^,) 

where A = o{N^''^) . If fi = o{N^''^) , then, uniformly for i G N, 

(4.3.12) Pi{X'^^'^ <n} = P{Poisson (A) < n} + 0[{X^ + n^)/N]. 
Equivalently, define Wn,^ by 

(4.3.13) X('^) = \ ^'^ ^ 

ln(l/z*) 

then, for s = o{N^'^) such that 

4.3.14 \ / G N, 

ln(l/z*) 

we have 

mWN,^, >s} = P{W^ >s} + [{s^ + fi^)/N] . 
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Corollary 4.3.4 (C-chain). If fJ, = o{N^^'^) , then, uniformly for i € N, 



(4.3.15) 



-""-*i|^-°''"^ 



Proof of Lemma 4.3.1 and Corollary 4.3.2. (a) By (4.3.3), (4.3.5) and (4.2.3), 

iV7r(5„) = A + 0(A/lniV), Np\S^) = 0[N-\NT:{S^)f] = 0{\^/N). 

Then, for j < fi, 

{N7T{Sn)y = X' {I + 0{i2/ In N)). 

So, by Corollary 3.4, (3.33), and (4.2.3), (4.3.3), 



J<M 



J! 



A^' 






(b) Given s > , 

ln[s-iBiV(lnn/lnz,; 



ln(l/z*) 



ln[s7^SiV(lnn/lnz*) 
ln(l/z*) 



where si G [s, sz^ ^). Using the definition of VFat,^ in (4.3.7) and the asymptotic 
formula (4.3.9) we obtain then: for s = o(lnN), 

(4.3.16) P {z,W^ > s} + 0[{s + fi)/ In N] <P^{WN,^,>s} 

< P{W^ >s} + 0[is + fi)/lnN]. 

For iJ. fixed, (4.3.16) implies that 

lim liminf P.lVFw,^ G [A~\A]} = 1, 

i. e. , in probability, Wn^^ is bounded away from zero and infinity, whence 
lnPF7V,/i = Op{l) . Suppose /i — )■ cxo. Then {W^ — fi)/fi^^'^ is asymptotically 
normal, with zero mean and unit variance. Consequently 



(4.3.17) 



lnW^ = ln^ + Op(l). 



Let y = y{N) — > oo so slow that s = iie^ = o{\nN) as well. Using the right hand 
side of (4.3.16), we obtain 

P^{\nWN,^. >\n^x + y}=P,{WN,^. > ^e^ 

= P{Wf,> fiey} + 0{is + fi)/ In N) 
(4.3.18) =P{lnW^ >lnn + y} + 0{{s + ^i)/lnN) = o(l). 
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Analogously, the left hand side of (4.3.16) delivers 
(4.3.19) lim Pi{lnWN,n>lnfi-y} = 0. 

The relations (4.3.17)-(4.3.19), together with (4.3.7) prove (4.3.10). D 

The proof of Lemma 4.3.3 and Corollary 4.3.4 is similar and we omit it. 

Recall that N € [iVi,iV2], A^i,2 = [aiy ±2i^'^/^lni^\ . Introduce iVo = [ai^\ ■ It is 
easy to check that the proof of Lemma 4.3.1 and Corollary 4.3.2 goes through with 
very minor changes if, instead of (4.3.5), we define an integer n by 



n 



ln[X-^BNo{lnNo/lnz^] 



ln(l/z*) 
(The key is that 

N(lnNf = {l + OiN-^/^lnNo))No{lnNof, 

uniformly for A^ in question.) The same change can be made in the formulation of 
Lemma 4.3.3 and Corollary 4.3.4 for the C-chain. This observation coupled with 
the fact that X}^ , the ^u-th largest value of {A(t)}Q^j^j^^, is sandwiched between 
those for {A(t)}o<f<Ar^ and {A(t)}o<t<Ar2 , show that in Lemma 4.3.1, Corollary 
4.3.2, Lemma 4.3.3 and Corollary 4.3.4 we can put X^ instead of X^^^' . Below, 
by the relations (4.3.6) and (4.3.12) we will mean their modifications, i.e. with 
X^''^ on their LHS. 

Turn to A^^) , the fi-th largest value among X{0),X{1), . . . X{M) . X{0) has 
the distribution vri given by either by (4.1.4) or by (4.1.14). Hence 

{0(nz"'), for cca. 



XM <^(a*) <x{0) + X^,^^- 



Now 

so, for the cca case, we use n defined by (4.3.5) and add an extra error term coming 
from (4.3.20), i.e. 

to the RHS of (4.3.6), to obtain the corresponding claim for X^^' . Likewise, in the 
C-case we need to add an error term 0(i/~^/^) to the RHS of (4.3.12). Again, we 
will refer to these new relations as (4.3.6) and (4.3.12). 

But then, according to Theorem 4.2.1, the //-th largest among the parts Yi, 
. . . , Yj^ of the corresponding random composition can replace X^^^' on the LHS of 
(4.3.6) and (4.3.12) respectively. These are our newest (4.3.6) and (4.3.12). 

Finally, if we include the rightmost parts ^j(4-i-iJ^M+2^ . . . , it will not substan- 
tially affect the the limiting behavior of the //-th largest overall part either. Here 
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is why. The number of these parts is m := [In z/] , at most. The total number of 
parts is q.s. of order ly ^ m, which means the last m parts are q.s. well defined. 
Those parts, read from right to left, and the first m parts, read from left to right, 
are equidistributed. By Theorem 4.2.1, these m first parts are within the total 
variation distance 0{v~^) , {y K > 0), from Z{1), . . . , Z{m) . We know that Z{1) 
has the distribution tti . Since 

sup y]p*(i,/i;) < supy]p(i,/i;), t > 1, 

we see that 

P{Z{t)>n} <supy^p{i,k), t>2. 

In view of (4.1.4) and (4.2.3), we obtain then: for the cca chain. 



InlX-'^BNoilnNo/lnz., 



|2] j f ln(t/ln^ I') 

4 ,A^^ln(l/^.) 



P { max Z(t) > -^ '. ''' *-^ }=0{ (In" u)z.. 

[i<t<m ln(l/z*) J \ / 

For the C-chain, the analogous probability is of order v~^'^\n u. Therefore, by 
adding yet another error terms 0{v~^hi u) and 0(z^~^'^ln v) to the RHS of 
(4.3.6) and (4.3.12) (where A'^ = A^o = L'^^J ■> ^^ course), we obtain the limiting 
distributions of the /i-th largest part of both random compositions, together with 
explicit error terms. (For the cca composition, the order of the total error term 
remains unchanged, i.e. 0((A + ;u)/ In i^) . 

In summary, we have proved the following. 

Theorem 4.3.5. For a random composition Y of u , let y(^) denote the [i-th 
largest part. Let Nq = [ai^J , a being defined in (4-1-17). Let IV^ be the sum of 
/i independent exponentials with unit mean, (i) For the random cca composition, 
define VF,y,^ by 

y(M) = ln[iy-^i3JVo(lnJVo/lnz,)2] 



ln(l/z*) 
B being defined in (4-3.3). Then, for s = o{lniy) such that 

ln[s-^BNo{lnNo/lnz,)^] ^^ 

— -^ G N 

ln(l/z.) 

we have 

P{W^,t. >s} = P{W^, >s} + 0[{s + fi)/ In v] . 
(ii) For the random C-composition, define W,j^^ by 

yi,) ^ HW.-:,BNo] 
ln(l/z*) ' 
B being defined in (4-3.4). Then, for s = o(j/^'^) such that 

In(s-^BNo) 
ln(l/z*) ^ ' 
we have 

P{W,,^. >s} = P{W^ >s} + 0(z.-i/2 in2 „ + {s^ + ,i^)/v). 

Here is a cruder estimate implied by Theorem 4.3.5. 
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Corollary 4.3.6. (i) For the random cca composition, 

''"•'- Ml/..) +°''"' (M = o(l...)). 
(ii) For the random C- composition, 



1^ 



Y^^^= -Ty^J +Opil), {fi = o{u'/')). 



(iii) So, for both cases. 



ln(l/z*) 



if ii = o{\d.u) and ii = o{iy^'^) respectively. 
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Appendix. 

Proof of Lemma 4.1.1. Consider the case of the random C-coniposition. The 
BGF of T(^, u) , the number of C-compositions of v with ^ parts, and v and ji 
marked by z and w respectively, is given by (4.1.9)-(4.1.10): 

f{w,z) = — 1 + -, h{w,z) = 1 — N 7. 

h{w,z) ^ 1 -)- vjz^ 

This bivariate series converges for \z\ < 1 and \w\ < l/\z\. So, choosing ri < 1, 
and r2 < 1/ri , we have 



T{u) 
11// f{w,z) 



T{u) (27ri)2 7 / 2!^ + li(;M+l 

zeCi weC2 



dwdz, 



where Ci , C2 are circles of radius ri and r2 respectively. In essence, it is this 
formula that, via Bender's method [5], enabled Louchard [19] and Louchard and 
Prodinger [21] to establish a sharp local limit theorem for Ai for the cca compo- 
sition and the C-composition. Since our goal is to bound the probability of large 
deviations, we use a considerably less analytical argument, which is a bivariate 
extension of Chernoff 's method. 

As a preparation, we need to define a differentiable extension of z* = 0.57 . . . , 
the smallest-module root of h{l,z) = 0. To this end, we compute 

So hz{l,z) < 0, hw{l,z) < for z G (0,1). By continuity of hz{z,w), hw{z,w), 
we obtain: there exists e G (0, 1 — z*) such that (1) (z^, + e)(l -|- e) < 1, and (2) 

(A.2) hziz, w) < 0, hwiz, w) <0, V {z, w) eRl : z < z^ -\- e, \w - 1\ < e. 
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Consequently, for |w — 1| < e , the equation h{z, w) = has a unique root z = z{w) , 
of multiplicity 1 , in [0, z* + e] , which is infinitely differentiable as a function of w , 
and z{l) = z* • In particular, 

, hy,{z{w),w) 

h^{z{w),w) 

that is z{w) is strictly decreasing. So z{w) > z* for ti; < 1, and z{w) < z* for 
ti; > 1. 

Now, the series for the bivariate generating function f{w, z) converges for \w — 
1| < e and \z\ < z{w) . Since all the coefficients in the series are nonnegative, 

Y, [z'w'] f{w, z) < l^^, u; G [1, 1 + eo], ^ G (0, z{w)). 

Likewise 

Y,[z''w']fiw,z) < l^^, w G [1 -eo,l], z G (0,zH). 

Here, by the definition of f{w, z) and z{w) , 

Q 

f{w,z) < —— , z<z{w). 

z{w) — z 

Therefore, for each m, 

(A.3) P{M>m)<c ^^4>) ^' ^e[l'l+^o], ^€(0,zH), 

and 

(A.4) P{M <m)< /""^""'^^H - z)-' ^ ^^[i_^^^i]^ ^^(o^^(^)). 

Consider (A.3). To get the most out of this upper bound we need to determine z 
and w that minimize the RHS, i. e. 

H^'^'{w,z) := —ulnz — mlnw — ln{z{w) — z). 

Let us find a stationary point {w,z) of H^^>{w,z) in the region w £ [1,1 + eo] > 
z G (0, z{w)) . From the equations 



we obtain that 



z 


V i 

z z(w) — z 


= 0, 


<m) _ 


m z'{w) 


= 


w 


w z{w) — z 




._ ^ .f...^ 
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where w = w{m) must be a root of 

(A.5) ^"'(^) 



m 



z{w) V + 1 

The equation (A.5) has a solution tt; = 1 if 

m = rh := {u + \) pi, fi := 

Furthermore, in [19] it was shown that 

d wz'{w) 



z{l) 



i/; = l 



z{l) 



dw z{w) 

is negative; this is — /3, /3 defined in (4.1.16). Since 

d f ra \ 1 



^-M 



dm V i^ + 1 



y + l 



<0 



as well, for 



< m — m = o{u), 



the equation (A.5) defines a strictly increasing w{m); so w{m) > 1 for m > m. 
More precisely 

R 

w{m) = 1 H {m — fh) + 0{{m — fh) jv ) 

/5, 



1 + -{m- fiu) +0{{m- fii^f/i^^). 



Now 



H^"^\w{m),z{w{m))) = -z/ln ( z.. 



In 



Also 



z^ + 1 7 Vi^ + l 

z^lnz^ +lnz/ + 0(l). 



-^ii'('")(^(H,^("i)) 

--H^\w{m),z{m)) + H^^'^{w{m),z{m)) + Hi"'\w{m),z{m)) 
-H^'{w{'m),z{'m)) = —lnw{m), 



which implies that 



dm 



H^'^\w{m),z{m)) 



Inw(m-) = 0, 
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and also that 



dm"^ 



H^"'\w{m),z{m)) 



w'{m) 



w[m) 
w'{fh) 

z^ + 1' 



Therefore, for < m — m = o(i^) , 

H'^'^\w{m),z{m)) = -ulnz^ + Inz/ - (1 + o(l))— — — -{m - rhf + 0{1) 

< — fXnz^ +\nu {m ~ jjlu) +0(1). 

Using this bound in (A. 3) for w = w{m) , z = z{w{m)) , and recalhng that T(i/) is 
of order z~'^ , we obtain: 



P(A4 > m) < cz^exp ( {m — jiv) ), < m — fiu = o{i') . 



Likewise 



P{Ai < m) < cv exp { [m — j-iu) ), ^ < jiv — m = o{v). 



The case of the random cca composition is quite analogous, so we omit the 
proof. D 
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